HMM Specialization with Selective Lexicalization

نویسندگان

  • Jin-Dong Kim
  • Sang-Zoo Lee
  • Hae-Chang Rim
چکیده

We present a technique which complements Hidden Markov Models by incorporating some lexicalized states representing syntactically uncommon words. Our approach examines the distribution of transitions, selects the uncommon words, and makes lexicalized states for the words. We performed a part-of-speech tagging experiment on the Brown corpus to evaluate the resultant language model and discovered that this technique improved the tagging accuracy by 0.21% at the 95% level of confidence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

L1 Glossing and Lexical Inferencing: Evaluation of the Overarching Issue of L1 Lexicalization

This empirical study reports on a cross-linguistic analysis of the overarching issue of L1 lexicalization regarding two (non)-interventionist approaches to vocabulary teaching. Participants were seventy four juniors at the Islamic Azad University, Roudehen Branch in Tehran. The investigation pursued (i) the impact of the provided (non)-interventionist treatments on both sets of (non)-lexicalize...

متن کامل

A New Strategy for Improving Feature Sets in a Discrete Hmm-based Handwriting Recognition System

In this paper we introduce a new strategy for improving a discrete HMM-based handwriting recognition system, by integrating several information sources from specialized feature sets. For a given system, the basic idea is to keep the most discriminative features, and to replace the others with new ones obtained from new feature spaces. After evaluating the individual discriminative power of each...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

The Impact of Metalinguistic English Vocabulary Knowledge and Lexical Inferencing on EFL Learners’ Lexical Knowledge Considering the Cross-Linguistic Issue of L1 Lexicalization

The present study endeavors to unravel the enigma of the psycholinguistic mechanisms underpinning bilingual mental lexicon by analyzing the issue of L1 lexicalization as a construct epitomizing an overarching framework. It involves 78 juniors at the Islamic Azad University, Roudehen Branch. The study inspects the impact of the interventionist/noninterventionist treatments on both sets of lexica...

متن کامل

Lexicalization vs. Vocalization: A Cross-Linguistic Study of Emphasis in English and Persian

Language is a system of verbal elements that makes communication of meaningspossible in the manners the users intend by employing certain linguistic deviceswhich are partly language-specific. Once communicating cross-linguistically, thereis always a risk of negative transfer of techniques or processes from the firstlanguage (L1) to the foreign language (L2). The current study investigates the“e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.CL/9912016  شماره 

صفحات  -

تاریخ انتشار 1999